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GENETIC SEQUENCES CONFERRING DISEASE 
RESISTANCE IN PLANTS AND USES THEREFOR 

5 The present invention relates generally to genetic sequences, and more particularly to 
genetic sequences which confer or otherwise facilitate disease resistance in plants 
such as against rust and mildew. The present invention further provides for 
transgenic plants carrying the subject genetic sequences enabling the generation of 
disease resistant plants. The present invention is particularly useful for developing 

10 disease resistance in crop varieties. 

Bibliographic details of the publications numerically referred to in this specification 
are collected at the end of the description. Sequence Identity Numbers (SEQ ID 
NOs.) for the nucleotide and amino acid sequences referred to in the specification are 
15 defined following the bibliography. 

Throughout this specification, unless the context requires otherwise, the word 
"comprise", or variations such as "comprises" or "comprising", will be understood to 
imply the inclusion of a stated element or integer or group of elements or integers 
20 but not the exclusion of any other element or integer or group of elements or 
integers. 

The rapidly increasing sophistication of recombinant DNA technology is greatly 
facilitating the efficacy of commercially important agricultural processes. Of 
25 particular concern is the effect of plant diseases on the efficacy of these agricultural 
processes. Plant diseases and particularly diseases in crop plants represent a major 
contributing factor in crop losses capable of causing economically significant down 
turn in productivity per square metre. The development, therefore, of disease 
resistant plants is an important goal in agricultural and horticultural research. 

30 
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Major genes conditioning resistance to plant diseases have been investigated 
extensively in agriculture. Of particular economic importance are genes controlling 
resistance to rust and mildew. Rust is an especially significant problem amongst 
5 broad acre crops such as wheat, barley and cereal grains and is caused by infection 
with a class of fungi known as the Basidiomycetes. Although rust resistance genes 
are a potentially invaluable genetic resource in agriculture, the molecular basis of 
major gene resistance is unknown and, until the present invention, genes conferring 
rust or mildew resistance had not been cloned. 

10 

Genetic analysis indicates that rust resistance genes control specific recognition of the 
products of rust avirulence genes. In developing disease resistance varieties using 
disease resistance genes, plant breeders deploy resistance genes that match the 
avirulence genes present in the local strains of the pathogens. The pathogen 

15 populations are dynamic and frequently new pathogenic strains arise by any number 
of means such as by mutation, recombination or accidental or natural introduction of 
new pathogenic strains. The existing disease resistant varieties then become 
susceptible to the new pathogenic strains. Plant breeders are then forced to develop 
new disease resistant varieties. At present, breeders use resistance genes that exist in 

20 either wheat or its relatives as their pool of new genes. 

The cloning of a resistance gene is the first step in understanding the basis of 
resistance gene action and in particular the specificity control mechanism. Ellis et al 
(1) have described the development of a transposon tagging system to isolate rust 
25 resistance genes from flax (linseed, Linum usitatissimum). However, despite the 
general effectiveness of the methodology, the isolation of mutant flax plants 
possessing a tagged resistance gene has not been a straight forward procedure (2). 

In accordance with the present invention, genetic sequences conferring rust resistance 
30 have been cloned from flax. The cloning of these sequences provides the means of 
generating transgenic plants with de novo, increased or otherwise enhanced rust 
resistance. The present invention also permits the screening through genetic or 
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immunological means similar rust resistance genes in other plants for use in 
developing or enhancing rust resistance in commercially and economically important 
species. Furthermore, the application of knowledge of the molecular basis behind 
resistance gene action and the specificity thereof offers the potential of a new source 
5 of genes produced by a variety of recombinant techniques. These new genes with 
altered disease resistance specificites are referred to herein as "modular resistance 
genes". 

Accordingly, one aspect of the present invention comprises an isolated nucleic acid 
10 molecule comprising a sequence of nucleotides which confers or otherwise facilitates 
disease resistance in a plant More particularly, the disease resistance is rust 
resistance. The present invention extends, however, to resistance to obligate 
biotrophic pathogens including but not limited to fungi, viruses and nematodes. 

15 Another aspect of the present invention is directed to a nucleic acid molecule which 
comprises a sequence of nucleotides corresponding or complementary to the 
nucleotide sequence set forth in Figure 1 (SEQ ID NO:l) or having at least 45% 
similarity to all or part thereof and wherein said nucleic acid molecule confers or 
otherwise facilitates rust resistance in a plant 

20 

In a related aspect of the present invention there is provided a nucleic acid molecule 
which comprises a sequence of nucleotides encoding or complementary to a sequence 
of nucleotides encoding the amino acid sequence set forth in Figure 1 (SEQ ID 
NOs:2 to 5) or having at least 45% similarity to all or part thereof and wherein said 
25 nucleic acid molecule confers or otherwise facilitates rust resistance in a plant 

Preferably, the percentage similarity to the sequence set forth in SEQ ID NO:l or 
SEQ ID NOs:2 to 5 is at least 50%. Even more preferably, the percentage similarity 
is at least 60%. Still more preferably, the percentage similarity is at least 65%. Yet 
30 still more preferably, the percentage similarity is at least 80-90% including at least 
91% or 93% or 95%. 
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For the purposes of nomenclature, the sequences in Figure 1 relate to the "L6" allele 
of the disease resistance gene L which controls host resistance to flax rust races that 
carry the corresponding avirulence gene AL6. However, the present invention 
extends to other alleles of the gene or alleles of the M gene. Accordingly, a related 

5 embodiment of the present invention contemplates a nucleic acid molecule 
comprising a sequence of nucleotides encoding or complementary to a sequence 
encoding a peptide, polypeptide or protein wherein said peptide, polypeptide or 
protein is capable of interacting with an avirulence gene product on a flax rust. 
Generally, when the rust resistance gene is the L6 allele, the corresponding 

10 avirulence gene is AL6. A similar nomenclature applied to the other alleles of the L 
gene, for example, the L2 and L10 alleles. Their corresponding avirulence genes are 
AL2 and ALIO, respectively. 

A further aspect of the present invention contemplates a nucleic acid molecule which 
IS confers or otherwise facilitates rust resistance in a plant and which is capable of 
hybridising under at least low stringency conditions to the nucleic acid molecule 
defined in SEQ ID NO:l. 

For the purposes of defining the level of stringency, reference can conveniently be 
20 made to Sambrook et al (3), which is herein incorporated by reference, where the 

washing steps at pages 9.52-9.57 are considered high stringency. A "low" stringency 

is defined herein as being in 0.1-0.5% w/v SDS at 37-45°C for 2-3 hours. 

Depending on the source and concentration of nucleic acid involved in the 

hybridisation, alternative conditions of stringency may be employed such as 
25 "medium" stringent conditions which are considered herein to be 0.25-0.5% w/v SDS 

at £45°C for 2-3 hours or "high" stringent conditions as disclosed by Sambrook et al 

(3). 

Although not intending to limit the present invention to any one theory or mode of 
30 action, it is proposed that the genetic sequences of the present invention are necessary 
for specific recognition of the products of rust avirulence genes. Accordingly, the 
genetic sequences are useful in enhancing existing rust resistance genes, providing de 
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novo the required specific recognition of rust avirulence gene products or being 
introduced together with rust avirulence genes on, for example, a single genetic 
cassette. Accordingly, these aspects of the invention are covered by the expression 
"conferring or otherwise enhancing rust resistance" or other similar expression. 

5 

The present invention is particularly directed to resistance conferring or facilitating 
genetic sequences from flax plants and from its relative Linum marginal e. However, 
the subject invention clearly contemplates other sources of rust resistance genes such 
as but not limited to other species of Linum, soybeans, sunflowers and cereals 
10 amongst other plants including wild varieties of wheat, barley and maize. 

The genetic sequences conferring or otherwise facilitating rust resistance may 
correspond to the naturally occurring sequence or may differ by one or more 
nucleotide substitutions, deletions and/or additions. Accordingly, the present 

15 invention extends to rust resistance genes and any functional alleles mutants, 
derivatives, parts, fragments, homologues or analogues thereof or non-functional 
molecules but which are at least useful as, for example, genetic probes or in the 
generation of immunologically interactive recombinant molecules. The present 
invention further extends to the promoter region from the rust resistance genes such 

20 as from L6, L2 or L10. A preferred promoter is from L6. Such promoters may be 
useful in driving expression of transgenes. 

Examples of alleles of the flax rust resistance gene L include, but are not limited to 
L6, L2 and L10 although all alleles of the L locus are encompassed by the present 
25 invention. The present invention also extends to alleles of the M locus. 

The present invention further contemplates recombinant or synthetic rust resistance 
gene products, such as the products of the L6, L2 or L10 alleles of the L resistance 
gene. A particularly preferred recombinant or synthetic rust resistance gene product 
30 is from the L6 gene. The L6 protein has the following features; an amino terminal 
half containing a nucleotide binding site (NBS) consisting of several motifs including 
the P-loop and further downstream, a kinase-2 motif (14). In L6, the kinase-2 
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sequence is ILWLDDVD (SEQ ID NO: 13) and more generally, XXXXDDX where 
X=L4,V,F (using single amino acid nomenclature defined below). The C-tenninal 
half of the polypeptide is a leucine rich region consisting of approximately 17% 
leucine residues. Within this region, stretches of residues appear where leucine is 
5 reiterated LXXL or LXL where X = any residue. These sequences are similar to the 
leucine-rich repeat sequences of 25-30 residues that are found in many proteins that 
form protein-protein interactions. More particularly in the L6 product, the leucine 
rich region of L6 consists of two parts with the C-tenninal part of the region 
consisting of two direct leucine rich repeats of 146 and 149 amino acids that are 74% 
10 identical. 

The above mentioned functional mutants, derivatives, parts, fragments, homologies 
and analogues of are referred to herein as "rust resistance-like genes" or "rust 
resistance genetic sequences". Reference herein to "genes" is to be taken in its 

15 broadest context and includes a classical genomic gene as well as mRNA or cDNA 
corresponding to the coding regions (i.e. exons) of the gene. The term "gene" is also 
used to describe synthetic or fusion molecules encoding all or part of a functional 
product. Preferred rust resistance-like genes are derived from a naturally occurring 
rust resistance gene by standard recombinant techniques. Generally, a rust resistance 

20 gene may be subjected to mutagenesis to produce single or multiple nucleotide 

substitutions, deletions and/or additions. Nucleotide insertional derivatives of the rust 
resistance gene of the present invention include 5' and 3' terminal fusions as well as 
intra-sequence insertions of single or multiple nucleotides. Insertional nucleotide 
sequence variants are those in which one or more nucleotides are introduced into a 

25 predetermined site in the nucleotide sequence although random insertion is also 
possible with suitable screening of the resulting product. Deletional variants are 
characterised by the removal of one or more nucleotides from the sequence. 
Substitutional nucleotide variants are those in which at least one nucleotide in the 
sequence has been removed and a different nucleotide inserted in its place. Such a 

30 substitution may be "silent" in that the substitution does not change the amino acid 
defined by the codon. Alternatively, substituents are designed to alter one amino 
acid for another similar acting amino acid. Typical substitutions are those made in 
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accordance with the following: 



10 



15 



20 



Suitable residues for amino acid substitutions 
Original Residue Exemplary Substitutions 



Ala 


Ser 


ATg 


Lvs 


A e«% 


Gin; His 


A cn 


Glu 




Ser 


vjin 


Asn 


VJIU 


Asd 


uiy 


Ala 


His 


Asn; Gin 


lie 


Leu: Val 


Leu 


lie; vai 


Lys 


Arg; Gin; 


Met 


Leu; He 


Phe 


Met; Leu; 


Ser 


Thr 


Thr 


Ser 


Trp 


Tyr 


Tyr 


Trp; Phe 


Val 


He; Leu 



25 In a particularly preferred embodiment, the rust resistance genetic sequences or like 
genetic sequences are employed to identify and isolate similar genes from other 
plants. According to this embodiment, there is contemplated a method for 
identifying a rust resistance genetic sequence or rust resistance-like genetic sequence 
in a plant, said method comprising contacting genomic DNA or cDNA isolated from 

30 said plant with a hybridisation effective amount of a genetic sequence conferring or 
otherwise rust resistance or part thereof and then detecting said hybridisation. 
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Preferably, the latter mentioned genetic sequence is from flax or similar plant such as 
a Linum species. In a most preferred embodiment, the latter genetic sequence is as 
set forth in SEQ ID NO:l or corresponds to a probe such as defined in Figure 2 
(SEQIDNO:ll). 

5 

Preferably, the latter genetic sequence is labelled with a reporter molecule capable of 
giving an identifiable signal (e.g. a radioisotope such as J P or S or a biotintylated 
molecule). 

10 Preferably, the plant to be screened is a flax, a Linum species or a grain crop plant 
such as wheat, barley, maize, rye, lupins or rice and/or wild varieties of same. 

Alternatively, the plant is screened using antibodies to a recombinant product of a 
rust resistance gene. According to a first aspect of this embodiment, the present 

15 invention provides for the expression of the subject genetic sequence in a suitable 
host (e.g. a prokaiyote or eukaryote) to produce full length or non-full length 
recombinant rust resistance gene products. The only requirement being that the 
recombinant products are immunologically interactive with antibodies to all or part of 
a rust resistance gene product The immunological screening for rust resistance gene 

20 products may, in some circumstances, be more suitable than a genetic screening 
procedure. 

Another aspect of the present invention is, therefore, directed to antibodies to a 
recombinant rust resistance gene product or part or fragment thereof. Such 

25 antibodies may be monoclonal or polyclonal and may be selected from naturally 

occurring antibodies to a rust resistance gene product or may be specifically raised to 
a recombinant rust resistance gene product. In the case of the latter, the rust 
resistance gene product may first need to be associated with a carrier molecule. 
Alternatively, fragments of antibodies may be used such as Fab fragments. 

30 Furthermore, the present invention extends to recombinant and synthetic antibodies 
and to antibody hybrids. A "synthetic antibody" is considered herein to include 
fragments and hybrids of antibodies. In additions, antibodies may be raised to 
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fragments or derivatives of the rust resistance gene product such as oligopeptides 
derived from or based on the product of, for example, the L6 gene. The antibodies 
and/or the recombinant rust resistance gene products of the present invention are 
particularly useful for the immunological screening of rust resistance gene products in 
5 various plants, in monitoring expression of rust resistance genetic sequences in 
transgenic plants and as a proprietary tagging system. 

In one embodiment, specific antibodies are used to screen for rust resistance gene 
products in plants. Techniques for the assays contemplated herein are known in the 
10 art and include, for example, sandwich assays and ELISA. 

It is within the scope of this invention to include any second antibodies (monoclonal, 
polyclonal or fragments of antibodies) directed to the first mentioned antibodies 
discussed above. Both the first and second antibodies may be used in detection 
15 assays or a first antibody may be used with a commercially available anti- 
immunoglobulin antibody. An antibody as contemplated herein includes any 
antibody specific to any region of a recombinant rust resistance gene product. 

Both polyclonal and monoclonal antibodies are obtainable by immunisation with a 
20 recombinant rust resistance gene product and either type is utilisable for 

immunoassays. The methods of obtaining both types of sera axe well known in the 
art Polyclonal sera are less preferred but are relatively easily prepared by injection 
of a suitable laboratory animal with an effective amount of recombinant rust 
resistance gene product, or antigenic or immunointeractive parts thereof, collecting 
25 serum from the animal and isolating specific sera by any of the known 

immunoadsorbent techniques. Although antibodies produced by this method are 
utilisable in virtually any type of immunoassay, they are generally less favoured 
because of the potential heterogeneity of the product 

30 The use of monoclonal antibodies in an immunoassay is particularly preferred 

because of the ability to produce them in large quantities and the homogeneity of the 
product The preparation of hybridoma cell lines for monoclonal antibody production 
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derived by fusing an immortal cell line and lymphocytes sensitised against the 
immunogenic preparation can be done by techniques which are well known to those 
who are skilled in the art (see, for example, references 4, 5 and 6). 

5 The presence of a rust resistance gene product in a plant or more commonly plant 
extract may be accomplished in a number of ways such as by Western blotting and 
. ELISA procedures. A wide range of immunoassay techniques are available as can 
be seen by reference to US Patent Nos. 4,016,043, 4, 424,279 and 4,018,653. These, 
of course, includes both single-site and two-site or "sandwich" assays of the non- 

10 competitive types, as well as in the traditional competitive binding assays. These 
assays also include direct binding of a labelled antibody to a target 

Sandwich assays are among the most useful and commonly used assays and are 
favoured for use in the present invention. A number of variations of the sandwich 

IS assay technique exist, and all are intended to be encompassed by the present 

invention. Briefly, in a typical forward assay, an unlabelled antibody is immobilised 
on a solid substrate and the sample to be tested brought into contact with the bound 
molecule. After a suitable period of incubation, for a period of time sufficient to 
allow formation of an antibody-antigen complex, a second antibody specific to the 

20 antigen, labelled with a reporter molecule capable of producing a detectable signal is 
then added and incubated, allowing time sufficient for the formation of another 
complex of antibody-antigen-labelled antibody. Any unreacted material is washed 
away, and the presence of the antigen is determined by observation of a signal 
produced by the reporter molecule. 

25 

In this case, the first antibody is raised to a recombinant rust resistance gene product 
and the antigen is a rust resistance gene product in a plant 

The results may either be qualitative, by simple observation of the visible signal, or 
30 may be quantitated by comparing with a control sample containing known amounts 
of hapten. Variations on the forward assay include a simultaneous assay, in which 
both sample and labelled antibody are added simultaneously to the bound antibody. 
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These techniques are well known to those skilled in the art, including any minor 
variations as will be readily apparent In accordance with the present invention the 
sample is one which might contain rust resistance gene product and include crude or 
purified plant extract such as extracts of leaves, roots and stems. 

5 

In the typical forward sandwich assay, a first antibody raised against a recombinant 
rust resistance gene product is either covalently or passively bound to a solid surface. 
The solid surface is typically glass or a polymer, the most commonly used polymers 
being cellulose, polyacrylamide, nylon, polystyrene, polyvinyl chloride or 

10 polypropylene. The solid supports may be in the form of tubes, beads, discs of 
microplates, or any other surface suitable for conducting an immunoassay. The 
binding processes are well-known in the art and generally consist of cross-linking, 
covalent binding or physically adsorption, the polymer-antibody complex is washed 
in preparation for the test sample. An aliquot of the sample to be tested is then 

15 added to the solid phase complex and incubated for a period of time sufficient (e.g. 
2-40 minutes) and under suitable conditions (e.g. 25°C) to allow binding of any 
antigen present in the sample to the antibody. Following the incubation period, the 
reaction locus is washed and dried and incubated with a second antibody specific for 
a portion of the first antibody. The second antibody is linked to a reporter molecule 

20 which is used to indicate the binding of the second antibody to the hapten. 

An alternative method involves immobilising the target molecules in the biological 
sample and then exposing the immobilised target to specific antibody which may or 
may not be labelled with a reporter molecule. Depending on the amount of target 
25 and the strength of the reporter molecule signal, a bound target may be detected by 
direct labelling with the antibody. Alternatively, a second labelled antibody, specific 
to the first antibody is exposed to the target-first antibody complex to form a target- 
first antibody-second antibody tertiary complex. The complex is detected by the 
signal emitted by the reporter molecule. 



30 
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By "reporter molecule" as used in the present specification, is meant a molecule 
which, by its chemical nature, provides an analytically identifiable signal which 
allows the detection of antigen-bound antibody. Detection may be either qualitative 
5 or quantitative. The most commonly used reporter molecules in this type of assay 
are either enzymes, fluorophores or radionuclide containing molecules (i.e. 
radioisotopes) and chemiluminescent molecules. 

In the case of an enzyme immunoassay, an enzyme is conjugated to the second 

10 antibody, generally by means of glutaraldehyde or periodate. As will be readily 
recognised, however, a wide variety of different conjugation techniques exist, which 
are readily available to the skilled artisan. Commonly used enzymes include 
horseradish peroxidase, glucose oxidase, beta-galactosidase and alkaline phosphatase, 
amongst others. The substrates to be used with the specific enzymes are generally 

15 chosen for the production, upon hydrolysis by the corresponding enzyme, of a 

detectable colour change. Examples of suitable enzymes include alkaline phosphatase 
and peroxidase. It is also possible to employ fluorogenic substrates which yield a 
fluorescent product rather than the chromogenic substrates noted above. In all cases, 
the enzyme-labelled antibody is added to the first antibody-hapten complex, allowed 

20 to bind, and then the excess reagent is washed away. A solution containing the 

appropriate substrate is then added to the complex of antibody-antigen-antibody. The 
substrate will react with the enzyme linked to the second antibody, giving a 
qualitative visual signal, which may be further quantitated, usually 
spectrophotometrically, to give an indication of the amount of hapten which was 

25 present in the sample. The term "reporter molecule" also extends to use of cell 

agglutination or inhibition of agglutination such as red blood cells on latex beads, and 
the like. 

Alternately, fluorescent compounds, such as fluorescein and rhodamine, may be 
30 chemically coupled to antibodies without altering their binding capacity. When 
activated by illumination with light of a particular wavelength, the fluorochrome- 
labelled antibody adsorbs the tight energy, inducing a state to excitability in the 



F 
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molecule, followed by emission of the light at a characteristic colour visually 
detectable with a light microscope. As in enzyme immunoassays (HA), the 
fluorescent labelled antibody is allowed to bind to the first antibody-hapten complex. 
After washing off the unbound reagent, the remaining tertiary complex is then 
5 exposed to the light of the appropriate wavelength the fluorescence observed indicates 
the presence of the hapten of interest. Immunofluorescene and EIA techniques are 
both very well established in the art and are particularly preferred for the present 
method. However, other reporter molecules, such as radioisotope, chemiluminescent 
or bioluminescent molecules, may also be employed. 

10 

It will be readily apparent to the skilled technician how to vary the above assays and 
all such variations are encompassed by the present invention. 

The present invention is particularly described with reference to the flax rust 
15 resistance gene L and its alleles L6, L2 and LI 0. This is done, however, with the 
understanding that the subject invention extends to a range of resistance genes and 
alleles thereof for rust and other pathogens. In fact, the present invention extends to 
a resistance gene characterised by said gene encoding a product having at least one 
leucine rich region. Preferably, the leucine rich region is a leucine rich repeat at the 
20 3' end of the molecule and has at least 60% similarly to each other more preferably 
at least 70% similarity and even more preferably at least 80% similarity. Still more 
preferably, the leucine rich region corresponds to the following amino acid sequence: 

PDLIELLPCELGGQTW . VPSMAELTIRDCPRLEVGPMIRSLPKFPMLKK 
25 LDLAVANITKEEDLDAIGSLEELVSLELELDDTSSGIERIVSSSKLQKLT 

TLWKVPSLREIEGLEELKSLQDLYLEGCTSLGRL PLEKLKE 

LDIGGC (SEQ ID NO: 12) 

or having at least 60% similarly thereto or more preferably at least 70% similarity or 
30 even more preferably at least 80% similarity thereto. Alternatively or in addition to, 
the rust resistance gene further encodes a p-Loop at its 5' end encoding the amino 
acid sequence: 

GXXXXGKT/S (SEQ ID NO:8), 
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where X is any amino acid residue. More preferably, the p-Loop sequence comprises 
the amino acid residues: 

GMGGIGKTT (SEQ ID NO:9), 

and more particularly 

5 GLYGMGGIGKTT (SEQ ID NO: 10), 

or having one or more amino acid substitutions, insertions and/or deletions thereto 
provided that such derivatives still function as a p-Loop. A p-Loop is involved in 
ATP/GTP binding. It is proposed herein that copy number and amino acid sequence 
of this repeated element are the determinants of the gene-for-gene specificity of rust 

10 resistance genes. Mixing and matching of these elements from existing genes will 
provide a potential means for creating new and useful resistance gene specificities for 
use in plant breeding. Such new genes are referred to herein as "modular resistance 
genes". This is a completely novel approach to disease resistance breeding. 

IS According to this embodiment, there is contemplated a modular resistance gene 

characterised in that said gene encodes a non-naturally occurring leucine rich region. 
By "non-naturally occurring" is meant to include the manipulation of a genetic 
sequence to introduce a leucine rich encoding sequence, to increase the copy number 
of an existing leucine rich encoding sequence, to delete or insert a nucleotide 

20 sequence for or into a leucine rich encoding sequence or to otherwise modify a 
leucine rich encoding sequence to modulate pathogen specificity of a disease 
resistance gene. For example, the leucine rich regions of two or more of alleles of 
the L locus such as L6, L2 and L10 could be combined to broaden the range of 
pathogens to which a gene can confer resistance. Alternatively, combinations of 

25 allels from the L and M loci could be made. The present invention contemplates, 
therefore, a method of producing a modular resistance gene conferring resistance to a 
pathogen in a plant by manipulating the leucine rich regions in a disease resistance 
gene. 
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The present invention further extends to transgenic plants such as transgenic crop 
plants (e.g. wheat, barley or maize) carrying a non-indigenous genetic sequence 
conferring or otherwise facilitating rust resistance in said plant Preferably, the non- 

5 indigenous genetic sequence is from a closely related species to the transgenic plant 
The expression of the non-indigenous genetic sequence may be constitutive or 
inducible or developmentally regulated. Furthermore, the non-indigenous sequence 
may be inserted into or fused to a particular endogenous genetic sequence. In a most 
preferred embodiment, the transgenic plant is wheat and the non-indigenous genetic 

10 sequence is from a wild variety of wheat, barley, maize, flax or Linum species. 
Other transformed species or resistance gene sources are not excluded. 

The present invention is further described by reference to the following non-limiting 
Figures and Examples. 

15 

In the Figures: 

Figure 1 is a representation of the nucleotide sequence (SEQ ID NO:l) and 
corresponding amino acid sequence (SEQ ID NOs:2 to 5) of the rust resistance gene 

20 L6. There are two predicted products, the "full length" and "truncated" products. 
The truncated product results from an alternately spliced mRNA that retains intron 3. 
In the absence of the splicing of this intron, translation would continue through the 
82bp intron 3 as a consequence of not changing frames, a stop codon is reached just 
downstream of the end of the intron. This results in a product from which the major 

25 portion of the leucine rich region is removed and also the addition of an extra 29 
amino (see underlined amino acids [SEQ ID NOs: 6 and 7) that are not in the full 
length product. 

Figure 2 is a representation of part of the L6 gene showing the LU-1 probe 
30 (underlined) and Ac insertion in mutant X75 (SEQ ID NO:l 1). 
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Figure 3 is a schematic representation of the L6 gene showing location of LU-2 
probe. 

5 Figure 4 is a representation of the amino acid sequences of the leucine rich repeat in 
L6. 

Figure 5 is a schematic representation of the L locus alleles. 

10 EXAMPLE 1 

PLANT MATERIAL 
Experiments were conducted in the line of flax called "Forge" which is homozygous 
for the rust resistance genes L 6 , M, N and P 2 (2). One particular flax is designated 
TC257 and is a primary transformant of "Forge" possessing 10 copies of the maize 

15 transposable element Ac (1). Rust gene cultivars are referred to as "Birio" (L 6 ), 
"Dakota" (M), "Bombay" (N) and "Abyssinian" (P 2 ) (8). 

EXAMPLE 2 

RUST STRAINS 

20 Four rust strains were used to screen for mutants. Three of these strains, designated 
CH5-78, CH5-84 and CH5-133, were obtained by selfing strain CHS. The fourth 
strain designated C was a parent strain of CHS. The origins of C and CHS have 
been previously described (7). Each of these strains is avirulent on one of the single 
gene differentials and virulent on the other three. Strain CH5-84 is avirulent on 

25 "Birio" (L 6 ), CH5-78 is avirulent on "Dakota" (M), C is avirulent on "Bombay" (N) 
and CHS* 133 is avirulent on "Abyssinian" (P 2 ). Rust maintenance and inoculation 
procedures were as previously described (8). 
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EXAMPLE 3 

TRANSFORMATION 
Flax cotyledons were transformed with either the vector pKU3 (9), pBT175 (10), 
5 pB135SAcll, 12 (11) or an Ac element in which the OCS enhancer (12) had been 
inserted at the BamHl site at position 182, according to the protocol previously 
described (13). 

EXAMPLE 4 

10 TAGGING SCHEME 

The tagging scheme used in this study has been described by Ellis et al (1) and 
Lawrence et al (13). "Forge" plants with and without Ac elements, including TC257 
and its selfed progeny, were extensively crossed as the female parent with 
"Hoshangabad" to produce hybrid seed heterozygous for the four rust resistance 

IS genes. When seedlings were about 5cm tall, the tip was cut off and the two lateral 
shoots that subsequently developed were inoculated with a mixture of die four rust 
strains, each of which recognise a single resistance gene. Most plants were resistant 
to all four rust strains. Susceptible mutants were of three types, namely, "whole", in 
which all leaves on both lateral shoots were susceptible, "bisectored", in which all 

20 leaves on one lateral were susceptible while the other shoot was entirely resistant and 
"mini-sectored", in which only some of the leaves on one shoot were susceptible. 
When rare rust-susceptible mutants were detected, the mutant gene was identified by 
recovering rust spores and inoculating these on to a set of four rust resistance genes 
L 6 , M, N or P 2 . If, for example, the rust recovered from a susceptible mutant plant 

25 grew on the M, N and P 2 differentials but not on the L 6 differential, then this 
indicated that the plant was mutant for the L 6 gene. 

EXAMPLE 5 

DETECTION OF TRANSPOSED Ac ELEMENTS IN MUTANT PLANTS 
30 The procedure for identifying transposed Ac elements was as previously described 
(2). 
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EXAMPLE 6 

LAMBDA CLONING 
Flax DNA was digested with fiamHI or BgRl and the unfractionated DNA was 
cloned into the Lambda vector EMBL4 as previously described (3). 

5 

EXAMPLE 7 

fiENFTIC ANALYSIS 
Genetic analysis including RFLP techniques, PCR analysis, Southern blot analysis 
and linkage analysis were as previously described (1; 2). The nucleotide and 

10 corresponding amino acid sequence for the L6 gene is shown in Figure 1. There are 
two predicted products, the "full length" and "truncated" products. The truncated 
product results from an alternately spliced mRNA that retains intron 3. In the 
absence of the splicing of this intron, translation would continue through the 82bp 
intron 3 and as a consequence of not changing frames, a stop codon is Teached just 

15 downstream of the end of the intron. This results in a product from which the major 
portion of the leucine rich region is removed and also the addition of an extra 29 
amino acids (see underlined amino acids in Figure 1) that are not in the full length 
product. 

20 EXAMPLE 8 

IDENTIFICATION AND CHARACTERISATION OF A RUST-SUSCEPTIBLE 
MUTANT PLANT IN WHICH THE L6 GENE HAS BEEN TAGGED BY Ac 
The TC257 (Example 1) line of transgenic flax containing at least 10 copies of Ac 
(1; 2), one of which was linked (29 map units) from the L6 rust resistance gene, was 
25 crossed extensively to a line without resistance genes and the progeny were screened 
for lack of L6 activity. One rust susceptible mutant lacking L6 specificity (designated 
mutant X75) contained a transposed Ac element which, from experimental evidence, 
is putatively located in the L6 gene. 
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EXAMPLE 9 

X75 CONTAINS A TRANSPOSED AC CLOSELY LINKED TO THE 

MUTANT L6 GENE 

5 The L6 mutant X75 contains a transposed Ac (referred to herein as n Tr-Ac") that is 
not present in its resistant parent The location of this element was mapped with 
respect to the L6 gene. A joint segregation analysis of Tr-Ac and the mutant L6 gene 
demonstrated tight linkage of these two characters. X75 was crossed to cultivar 
"Birio" (L6) and progeny that inherited Tr-Ac were identified by Southern analysis. 

10 Two progeny plants, D766 and D769, were test-crossed to the cultivar 

"Hoshangabad" (no L6). Thirty six progeny were tested for L6 rust resistance and 
scored for the presence of Tr-Ac. Eighteen of the progeny were susceptible and these 
all possessed Tr-Ac while the remaining 18 were resistant and these all lacked Tr-Ac. 
This is the result expected if the allele for susceptibility is an L6 gene inactivated by 

15 Tr-Ac. 

The insertion site of Tr-Ac was also mapped by restriction fragment length 
polymorphism (RFLP) analysis in a family of 88 test-cross progeny in which L6 was 
segregating. A fragment of flax DNA (probe LU-1) isolated from a lambda clone 

20 containing the 3' junction between Tr-Ac and flax DNA, detected 3 RFLPs that 
distinguished the two parents when their DNA was cut with £coRI, BgUl and XbaL 
The analysis of the segregation of these three markers and the L6 gene in the test- 
cross demonstrated that all four markers were closely linked. A single recombinant 
individual involving L6 and the RFLP marker detected in Xbal digested DNA was 

25 detected among the 88 progeny. As detailed below, this recombination event present 
in progeny plant D237 occurred within the region of the L6 gene. 
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EXAMPLE 10 

RECOMBINATION IN THE VICINITY OF THE Ac INSERTION SITE 
ALTERS THE RESISTANCE REACTION OR SPECIFICITY OF L6 
5 The RFLP probe LU-1 was used to establish a restriction map of the homologous 
regions of the resistant and susceptible parents of the test cross family and of the 
recombinant progeny plant D237. Three polymorphic restriction sites were detected 
and comparison of the maps demonstrated that a cross-over had occurred within a 
region of about 3kbp on either side of the Ac insertion site. 

10 

The recombinant plant D237 was selfed and 16 progeny were screened with a rust 
isolate that recognises L6. The progeny and were also analysed by Southern blotting 
to detect the recombinant chromosome. The progeny segregated for rust reaction: 6 
were fully susceptible while 10 were partially resistant with restricted rust growth 
15 confined to the younger leaves. This result is consistent with the segregation of a 
novel gene for partial resistance which was not present in the original Forge parent 
There was a complete association between the second class with novel rust reaction 
and the presence of the recombinant chromosome. 

20 Thus, crossing over in the vicinity of the L6 gene results in an alteration of the L6 
resistance phenotype. 

EXAMPLE 11 

INDEPENDENT L6 MUTANTS CONTAIN INSERTIONS 
25 IN THE REGION OF 0.5 TO 3.0KB OF THE Ac INSERTION SITE 

Thirty four independent L6 mutants were isolated in the screen for tagged L6 genes. 
All but X75 were either deletions or, if not deletions, contained no transposed Ac 
elements (2). Analysis of mutants in the latter class using the LU-1 probe revealed 
thrt two of them PGA and XI 17) contained a small (approximately 200bp) insertion 
30 in this region. In the case of XI 17, the mutant was compared to its resistant parent to 
confirm that the polymorphism was not pre-existing. In the case of X3 A, the 
insertion and loss of L6 had occurred somatically. Mutant X3A arose as a sector. 
Among two shoots on a single Fl plant, one (X3A) had lost L6 and the second, X3B 
was wild-type for L6. DNA from the two sectors was examined using the LU-1 
35 probe. An insertion was observed only in DNA from the mutant sector. 
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EXAMPLE 12 

EXCISION OF TR-AC IN PROGENY OF X7S IS ASSOCIATED 
WITH REVERSION TO RUST RESISTANCE 
The X75 mutant was selfed and progeny that were homozygous for the transposed Ac 
5 element were identified by Southern blotting. Selfed progeny of four such plants 
were in turn screened with a rust strain that recognises the L6 resistance specificity. 
Thirty seven resistant plants were identified among 3105 progeny. 15 of the 
revertants were examined by either PCR or Southern analysis for an excision 
fragment that would result from the excision of Ac from the L6 region. The excision 
10 marker occurred in all the plants. In two cases, the excision region was sequenced 
and small base changes, (an "Ac footprint") in the 8bp repeat that flanked Ac in X75 
resulted from the excision process. 

EXAMPLE 13 

15 A DNA PROBE ADJACENT TO Ac DETECTS A SECOND LOCUS 

IN FLAX. CLOSELY LINKED TO THE M RESISTANCE LOCUS 
A second DNA probe, LU-2 was isolated from the same lambda clone as LU-1. 
These two fragments are separated by about 2kb. LU-2, like many other flax DNA 
probes, hybridises to two genomic fragments in most restriction digests, providing 

20 molecular evidence for the tetraploid nature of flax. In Xba\ digests, LU-2 detects 
two polymorphic markers, LU-2-1 and LU-2-2 that distinguish the resistant and 
susceptible parents of a test-cross family of 52 progeny among which the L6 and M 
resistance genes segregate. Joint segregation analysis of these two RFLP markers and 
the L6 and M resistance genes demonstrated complete linkage of LU-2-1 with L6 and 

25 LU-2-2 with the resistance gene M. Thus, cloning the L6 locus has allowed access to 
the M resistance gene region. The LU-2 probe and A flanked Ac in X75 in L6 is 
shown in Figures 2 and 3. 

EXAMPLE 14 

30 RUST RESISTANCE GgNES FOfiM A MULUGFNE FAMILY IN FLAX 

The probe LU-1 and several adjacent restriction fragments from a lambda clone were 
used as hybridisation probes to isolate a cDNA from a library generated using leaf 
mRNA. A cDNA clone and several other probes from the L6 gene have been used as 
probes for genomic Southern analysis. These probes generally detect multiple 
35 fragments in Southern blots which indicates that flax contains a family of genes of 
similar sequence. All RFLPs detected with this probe map to either the L6 or M 
resistance gene regions. 
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EXAMPLE 15 

PROBE LU-1 IDENTIFIES ALLELES OF L6 
In genetic studies, the L locus of flax behaves as a single gene with multiple allelic 
specificities. Twelve resistance specificities map to the L locus. The probe LU-1 was 
5 used to examine a differential set of flax plants that each contain a single L allele 
(see Figures 2 and 3). DNA from the 12 differentials and from "Hoshangabad" which 
carries a "null" allele at the L locus was cut with Xbdl, BglU and EcoRl. In each 
case, a single fragment was detected in the L differentials and each L allele could be 
distinguished from the null allele of "Hoshangabad". Using these three enzymes, 10 
10 RFLP genotypes were detected. Only two pairs of L alleles, L3 and L4 and L6 and 
L7, could not be distinguished with this limited sample of digests. 

EXAMPLE 16 

A cDNA FROM FLAX DETECTS SEQUENCE SIMILARITY IN 
15 A WILD LINUM SPECIES 

Southern analysis of DNA from the wild species, Linum morginole which contains 
genes controlling resistance against flax rust, detected strongly hybridising fragments. 

EXAMPLE 17 

20 DETERMINATION OF AMINO ACID SEQUENCE OF L6 

The amino acid sequence of the L6 cDNA was determined and is shown in Figure 1. 
The amino acid sequence is shown in single letter code, single letter abbreviations for 
amino acids are defined below: 
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Amino Acid One-letter Symbol 

Alanine A 

5 Arginine R 

Asparagine N 

Aspartic acid D 

Cysteine C 

Glutaxnine Q 

10 Glutamic acid £ 

Glycine G 

Histidine H 

Isoleucine I 

Leucine L 

15 Lysine K 

Methionine M 

Phenylalanine F 

Proline P 

Serine S 

20 Threonine T 

Tryptophan W 

Tyrosine Y 

Valine V 

Amino Acid X 
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EXAMPLE 18 

COMPARISON OF AMINO ACID SEQUENCE OF 
LEUCINE RICH REPEAT FROM L6 

The L6 allele comprises a leucine rich repeat at amino acid residues 969 to 11 15 and 
1 1 16 to 1265. This repeat was compared for identity and the comparison is shown in 
Figure 4. Details of the comparisons are shown below: 



10 Length: 156 amino acids 

Gaps: 3 

Gap weight: 3.000 

Length weight: 0.100 

Quality: 160.4 

15 Ratio: 1.091 

Average match: 0.540 

Average mismatch: -0.396 

Percent similarity: 82.979% 

Percent identity: 74.468 

20 

EXAMPLE 19 

CLONING AND COMPARISO N OF L LOCUS ALLELES 



Using the clones L6 gene as a probe (probe LU-1), the L2 and L10 alleles were 
25 cloned on EcoRl DNA fragments. These alleles have different resistance specificities 
and control resistance to flax rust strains that cany the AL2 and ALIO avirulence 
genes, respectively. 

The L2 and L10 clones were subject to restriction endonuclease analysis and a 
30 schematic represented is shown in Figure 5. The L2 EcoRI fragment is about 400bp 
longer than the corresponding L10 fragment. DNA sequence analysis of both ends of 
the L2 and L10 fragments revealed practically identical sequence (>90%) and 
indicates that L2 differs from L10 by about 400bp of extra DNA. Furthermore, 
sequence analysis and restriction fragment mapping from internal restriction sites 
35 confirmed the extra DNA and indicated that the 5' halves of L2, L6 and LI 0 are very 
similar (>90% identical) and that the differences between the alleles occurs in the 3' 
half of the gene. The difference in gene length occurs in a region coding for the 
leucine rich repeat structure of 156 amino acids that has two copies in L6 (Figure 4) 
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and only one copy in a cDNA from a related but non-allelic gene called FC4. 

Those skilled in the art will appreciate that the invention described herein is 
susceptible to variations and modifications other than those specifically described. It 
5 is to be understood that the invention includes all such variations and modifications. 
The invention also includes all of the steps, features, compositions and compounds 
referred to or indicated in this specification, individually or collectively, and any and 
all combinations of any two or more of said steps or features. 
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(in) NUMBER OF SEQUENCES: 13 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: DAVIES COLLISON CAVE 

(B) STREET: 1 LITTLE COLLINS STREET 

(C) CITY: MELBOURNE 

(D) STATE: VICTORIA 

(E) COUNTRY: AUSTRALIA 

(F) ZIP: 3000 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(Q OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: PCT INTERNATIONAL 

(B) FILING DATE: 21-APR-1995 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: PM523 1/94 (AU) 

(B) FILING DATE: 21-APR-1994 
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(viii) ATTORNEY/AGENT INFORMATION: 
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(C) REFERENCE/DOCKET NUMBER: EJH/EK 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: +61 3 254 2777 

(B) TELEFAX: +61 3 254 2770 



WO 95/29238 



-28- 



PCT/AU95/00240 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4841 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: DNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 163.. 7B9 

(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 1097.. 2203 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 2669.. 4525 

(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 2293.. 2586 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 



GAGCTCAGAA GTAAGGGATA AGGCAAGAAG CAGAGGAGCA GAGAGAAGAA CCAGCAAAGC 60 

ACATGCAAAT TGAAGCAGGC AAGAACAGTT ACTGGAAATT CATTTATCTC TGCTTTCAAT 120 

TTCTATCCTT CAGATCATTT CTGCTCAATT GAATCACTAG TC ATG AGT TAT TTG 174 

Met Ser Tyr Leu 
1 

AGA GAA GTT GCT ACT GCT GTT GCC TTG CTT CTC CCT TTC ATT CTT CTC 222 
Arg Glu Val Ala Thr Ala Val Ala Leu Leu Leu Pro Phe He Leu Leu 
5 10 15 20 

AAC AAG TTT TGG ADA CCA AAT TCC AAA GAC TCA ATC GTC AAC GAT GAT 270 
Asn Lys Phe Trp Arg Pro Asn Ser Lys Asp Ser He Val Asn Asp Asp 

25 30 35 

GAC GAT TCA ACA TCT GAA GTT GAT GCC ATA TCC GAC TCC ACA AAT CCC 318 
Asp Asp Ser Thr Ser Glu Val Asp Ala He Ser Asp Ser Thr Asn Pro 
40 45 50 

TCT GGT TCA TTT CCC TCC GTG GAG TAT GAA GTG TTT TTG AGT TTC AGG 366 
Ser Gly Ser Phe Pro Ser Val Glu Tyr Glu Val Phe Leu Ser Phe Arg 
55 60 65 



GGT CCA GAT ACT CGT GAA CAG TTC ACC GAT TTC CTA TAT CAG TCT CTC 
Gly Pro Asp Thr Arg Glu Gin Phe Thr Asp Phe Leu Tyr Gin Ser Leu 
70 75 80 



414 
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CGT CGC TAT AAG ATT CAC ACT TTT AGG GAC GAC GAT GAG CTA CTC AAA 462 
Arg Arg Tyr Lys lie His Thr Phe Arg Asp Asp Asp Glu Leu Leu Lys 
85 90 95 100 

GGA AAA GAA ATA GGG CCC AAC CTC CTA CGA GCA ATT GAT CAG TCC AAA 510 
Gly Lys Glu lie Gly Pro Asn Leu Leu Arg Ala He Asp Gin Ser Lys 
105 HO US 

ATT TAC GTC CCG ATC ATA TCG AGC GGA TAT GCT GAT AGT AAG TGG TGT 55 B 

He Tyr Val Pro He He Ser Ser Gly Tyr Ala Asp Ser Lys Trp Cys 
120 125 130 

CTT ATG GAG CTT GCT GAA ATT GTG AGA CGT CAA GAG GAG GAC CCT CGA 606 
Leu Met Glu Leu Ala Glu He Val Arg Arg Gin Glu Glu Asp Pro Arg 
135 140 145 

CGC ATC ATA CTT CCT ATT TTT TAT ATG GTG GAT CCA AGT GAC GTA CGA 654 
Arg He He Leu Pro He Phe Tyr Met Val Asp Pro Ser Asp Val Arg 
150 155 160 

CAT CAG ACT GGA TGT TAT AAA AAA GCA TTT CGA AAA CAC GCA AAT AAA 702 
His Gin Thr Gly Cys Tyr, Lys Lys Ala Phe Arg Lys His Ala Asn Lys . 
165 170 175 1B0 

TTT GAT GGA CAG ACC ATA CAA AAC TGG AAA GAT GCT CTA AAG AAG GTC 750 
Phe Asp Gly Gin Thr He Gin Asn Trp Lys Asp Ala Leu Lys Lys Val 
185 190 195 

GGA GAC TTA AAA GGA TGG CAC ATC GGA AAG AAT GAC AAG TATGTAATCC 799 
Gly Asp Leu Lys Gly Trp His He Gly Lys Asn Asp Lys 
200 205 

TCATCCTAGC CTTTTATCAT TCAGTACAAC TTATTTGTTT TGTCTTGCAT ATACTTACTT 859 

GTTTATTGAC TAACTTTCAA ATTGCATATT AACATTCTAG GAAGATTTAA ATTGGACTCG 919 

GGTCACAACT GAACCATATT ATTTTACACG ACAAAGCACG AGCTATTCCT AATGTAATTA 979 

AAGTTAAAAT GCTCAATAAC GTGGAATATA TACTTGTATA CAGGGGCAAC AAACTATAAA 1039 

TTGAAATATT ATTTCATATG AAGTTAACAT CTCAAAAATG TATTTAATAC TTGTAGG 1096 

CAG GGA GCT ATA GCA GAC AAA GTA TCA GCA GAT ATA TGG TCA CAC ATA 1144 
Gin Gly Ala He Ala Asp Lys Val Ser Ala Asp He Trp Ser His He 
15 10 15 

AGC AAG GAA AAT CTC ATT TTA GAA ACC GAT GAG TTG GTT GGA ATT GAT 1192 
Ser Lys Glu Asn Leu He Leu Glu Thr Asp Glu Leu Val Gly He Asp 
20 25 30 



GAT CAC ATA ACA GCC GTA TTA GAA AAA TTG AGT 
Asp His He Thr Ala Val Leu Glu Lys Leu Ser 
35 40 



TTA GAC 
Leu Asp 
45 



TCT GAA AAT 
Ser Glu Asn 



1240 
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GTG ACA ATG GTC GGC CTT TAT GGT ATG GGT GGA ATA GGC AAG ACG ACC 128 B 

Val Thr Met Val Gly Leu Tyr Gly Met Gly Gly He Gly Lys Thr Thr 
50 55 60 

ACT GCA AAG GCG GTT TAT AAC AAG ATT TCT TCT TGT TTC GAT TGT TGT 1336 
Thr Ala Lys Ala Val Tyr Asn Lys He Ser Ser Cys Phe Asp Cys Cys 
65 70 75 80 

TGT TTT ATT GAC AAC ATA CGT GAA ACA CAA GAG AAG GAT GGC GTT GTT 1384 
Cys Phe He Asp Asn He Arg Glu Thr Gin Glu Lys Asp Gly Val Val 

85 90 95 

GTT TTG CAA AAG AAG CTA GTA TCT GAA ATT TTG AGG ATC GAT TCG GGT 1432 
Val Leu Gin Lys Lys Leu Val Ser Glu He Leu Arg He Asp Ser Gly 
100 105 110 

TCG GTT GGA TTT AAT AAT GAT AGT GGC GGA CGG AAG ACG ATA AAG GAG 1480 
Ser Val Gly Phe Asn Asn Asp Ser Gly Gly Arg Lys Thr He Lys Glu 
115 - 120 125 

AGA GTT TCG AGG TTC AAA ATT CTT GTC GTT CTC GAT GAT GTG GAT GAG 1528 
Arg Val Ser Arg Phe Lys He Leu Val Val Leu Asp Asp Val Asp Glu 
130 135 140 

AAG TTT AAA TTT GAA GAT ATG TTG GGA AGT CCT AAA GAT TTT ATT TCT 1576 
Lys Phe Lys Phe Glu Asp Met Leu Gly Ser Pro Lys Asp Phe He Ser 
145 150 155 160 

CAA AGT AGA TTC ATT ATT ACT TCA AGA AGT ATG AGA GTT TTG GGT ACT 1624 
Gin Ser Arg Phe He He Thr Ser Arg Ser Met Arg Val Leu Gly Thr 
165 170 175 

TTG AAT GAG AAT CAA TGC AAG TTG TAT GAA GTT GGA TCG ATG AGC AAA 1672 
Leu Asn Glu Asn Gin Cys Lys Leu Tyr Glu Val Gly Ser Met Ser Lys 
180 185 190 

CCA CGT TCG CTT GAA CTC TTC TCC AAG CAT GCA TTC AAA AAG AAT ACG 1720 
Pro Arg Ser Leu Glu Leu Phe Ser Lys His Ala Phe Lys Lys Asn Thr 
195 200 205 

CCT CCA TCG TAT TAT GAG ACT CTA GCA AAT GAC GTC GTA GAT ACT ACA 1768 
Pro Pro Ser Tyr Tyr Glu Thr Leu Ala Asn Asp Val Val Asp Thr Thr 
210 215 220 

GCA GGA CTT CCA TTG ACT CTG AAG GTT ATA GGA TCG CTT TTA TTT AAA 1816 
Ala Gly Leu Pro Leu Thr Leu Lys Val He Gly Ser Leu Leu Phe Lys 
225 230 235 240 

CAA GAG ATT GCG GTT TGG GAA GAC ACG TTG GAA CAA TTA CGT AGA ACA 1864 
Gin Glu He Ala Val Trp Glu Asp Thr Leu Glu Gin Leu Arg Arg Thr 
245 250 255 

CTT AAC CTT GAT GAG GTT TAT GAT AGG CTA AAA ATA AGT TAT GAT GCG 1912 
Leu Asn Leu Asp Glu Val Tyr Asp Arg Leu Lys He Ser Tyr Asp Ala 
260 265 270 



i 
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TTC AAC CCG GAG GCA AAA GAG ATT TTC TTG GAT ATA GCT TGC TTC TTC I960 
Leu Asn Pro Glu Ala Lys Glu lie Phe Leu Asp lie Ala Cys Phe Phe 
275 280 285 

ATC GGA CAA AAT AAA GAA GAA CCG TAT TAC ATG TGG ACC GAC TGT AAT 2008 
He Gly Gin Asn Lys Glu Glu Pro Tyr Tyr Met Trp Thr Asp Cys Asn 
290 295 300 

TTT TAT CCA GCA AGT AAT ATT ATT TTT CTC ATT CAA AGA TGT ATG ATA 2056 
Phe Tyr Pro Ala Ser Asn He He Phe Leu He Gin Arg Cys Met He 
305 310 315 320 

CAA GTT GGG GAT GAT GAT GAG TTT AAA ATG CAC GAC CAA CTT AGA GAT 2104 
Gin Val Gly Asp Asp Asp Glu Phe Lys Met His Asp Gin Leu Arg Asp 
325 330 335 

ATG GGT AGA GAA ATT GTG AGA CGA GAG GAT GTA CTG CCG TGG AAG AGA 2152 
Met Gly Arg Glu He Val Arg Arg Glu Asp Val Leu Pro Trp Lys Arg 
340 345 350 



AGT AGA ATA TGG TCG GCA GAA GAA GGG ATT GAT CTC TTG CTG AAC AAA 2200 
Ser Arg He Trp Ser Ala Glu Glu Gly He Asp Leu Leu Leu Asn Lys 
355 360 365 

AAG GTATTCAGTT TTATTTAAAA TTAAATATTC ATATATTATT ACACATACTT 2253 
Lys 



TAAATCACAT AACTCATTAC GTTCCTTCTC CAATTACAG GGA TCA AGT AAA GTA 2307 

Gly Ser Ser Lys Val 
1 5 

AAA GCA ATT AGC ATA CCC TGG GGT GTC AAG TAT GAG TTT AAG AGC GAA 2355 
Lys Ala He Ser He Pro Trp Gly Val Lys Tyr Glu Phe Lys Ser Glu 

10 15 20 

TGT TTC TTG AAT TTG TCA GAG TTG AGA TAC CTC CAT GCA AGG GAA GCC 2403 
Cys Phe Leu Asn Leu Ser Glu Leu Arg Tyr Leu His Ala Arg Glu Ala 
25 30 35 

ATG CTT ACC GGA GAT TTC AAC AAT CTT CTC CCG AAT TTA AAG TGG CTT 2451 
Met Leu Thr Gly Asp Phe Asn Asn Leu Leu Pro Asn Leu Lys Trp Leu 
40 45 50 

GAG TTG CCA TTT TAC AAA CAT GGA GAG GAT GAT CCT CCT TTG ACC AAT 2499 
Glu Leu Pro Phe Tyr Lys His Gly Glu Asp Asp Pro Pro Leu Thr Asn 
55 4 60 65 

TAT ACC ATG AAA AAT CTG ATA ATT GTT ATT CTT GAG CAT AGC CAC ATA 2547 
Tyr Thr Met Lys Asn Leu He He Val He Leu Glu His Ser His He 
70 75 80 85 



ACG GCT GAT GAT TGG GGA GGT TGG AGG CAT ATG ATG AAG GTGTGTTGTT 
Thr Ala Asp Asp Trp Gly Gly Trp Arg His Met Met Lys 

90 95 



2596 
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TTTCAGCTGT TCATATGAAG GTGTGTTATC TTCTTATTTG TTCTTCATAT TTCTGTTTTA 2656 

ATCTG CTGTC AG ATG GCT GAG AGG CTG AAA GTT GTA CGA CTT GCT TCA 2704 
Met Ala Glu Arg Leu Lys Val Val Arg Leu Ala Ser 
15 10 

AAC TAT AGT TTG TAC GGA AGA CGT GTT CGC CTT TCT GAC TGT TGG CGC 2752 
Asn Tyr Ser Leu Tyr Gly Arg Arg Val Arg Leu Ser Asp Cys Trp Arg 
15 20 25 

TTC CCC AAA AGC ATT GAG GTA TTA TCC ATG ACT GCG ATA GAA ATG GAT 2800 
Phe Pro Lys Ser lie Glu Val Leu Ser Met Thr Ala He Glu Met Asp 
30 35 40 

GAA GTT GAT ATT GGG GAG TTA AAG AAG CTA AAG ACG TTG GTT CTG AAA 264 8 

Glu Val Asp He Gly Glu Leu Lys Lys Leu Lys Thr Leu Val Leu Lys 
45 50 55 60 

TTC TGT CCA ATA CAA AAG ATA AGT GGG GGA ACC TTT GGT ATG TTG AAG 2896 
Phe Cys Pro He Gin Lys He Ser Gly Gly Thr Phe Gly Met Leu Lys 

65 70 75 

GGA CTT CGA GAG CTT TGT CTC GAA TTC AAC TGG GGG ACA AAT TTG AGA 2944 
Gly Leu Arg Glu Leu Cys Leu Glu Phe Asn Trp Gly Thr Asn Leu Arg 
80 85 90 

GAG GTA GTT GCC GAT ATT GGT CAA CTT TCA TCT CTC AAA GTC TTG AAA 2992 
Glu Val Val Ala Asp He Gly Gin Leu Ser Ser Leu Lys Val Leu Lys 
95 100 105 

ACA ACC GGA GCT AAG GAG GTC GAG ATT AAT GAA TTT CCA TTA GGT TTG 3040 
Thr Thr Gly Ala Lys Glu Val Glu He Asn Glu Phe Pro Leu Gly Leu 
110 115 120 

AAG GAG TTA TCC ACT TCA TCT CGG ATT CCG AAT CTT TCA CAG TTG TTG 3088 
Lys Glu Leu Ser Thr Ser Ser Arg lie Pro Asn Leu Ser Gin Leu Leu 
125 130 135 140 

GAT TTG GAG GTA CTG AAG GTT TAT GAT TGC AAG GAT GGA TTT GAC ATG 3136 
Asp Leu Glu Val Leu Lys Val Tyr Asp Cys Lys Asp Gly phe Asp Met 
145 150 155 

CCT* CCT GCT AGT CCG AGT GAA GAT GAA AGT AGT GTG TGG TGG AAG GTA 3184 
Pro Pro Ala Ser Pro Ser Glu Asp Glu Ser Ser Val Trp Trp Lys Val 
160 165 170 

TCC AAG TTG AAG TCT TTG CAA CTC GAG AAG ACA AGA ATC AAT GTC AAC 3232 
Ser Lys Leu Lys Ser Leu Gin Leu Glu Lys Thr Arg lie Asn Val Asn 
175 180 IBS 

GTT GTG GAT GAT GCT TCT TCC GGT GGT CAC CTC CCT CGT TAC TTA CTA 3280 
Val Val Asp Asp Ala Ser Ser Gly Gly His Leu Pro Arg Tyr Leu Leu 
190 195 200 
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CCA ACA TCC CTA ACC TAT CTT AAA ATT TAT CAG TGT ACA GAA CCA ACG 3328 
Pro Thr Ser Leu Thr Tyr Leu Lys He Tyr Gin Cys Thr Glu Pro Thr 
205 210 215 220 

TGG CTT CCA GGA ATA GAG AAC TTG GAG AAT TTG ACT TCG CTG GAA GTC 3376 
Trp Leu Pro Gly He Glu Asn Leu Glu Aan Leu Thr Ser Leu Glu Val 
225 230 235 

AAC GAC ATC TTC CAA ACT CTT GGA GGT GAC TTG GAT GGG CTA CAA GGG 3424 
Asn Asp He Phe Gin Thr Leu Gly Gly Asp Leu Asp Gly Leu Gin Gly 
240 245 250 

TTG AGA TCA^TTG GAA ATT CTT AGG ATT CGG AAA GTA AAT GGT TTA GCT 3472 
Leu Arg Ser Leu Glu He Leu Arg He Arg Lys Val Asn Gly Leu Ala 
255 260 265 

CGG ATC AAA GGG CTT AAG GAT CTC TTG TGT TCT TCT ACC TGC AAG TTG 3520 
Arg He Lys Gly Leu Lys Asp Leu Leu Cys Ser Ser Thr Cys Lys Leu 
270 275 280 

CGG AAA TTT TAT ATT ACA GAA TGC CCC GAC CTC ATT GAG TTA CTC CCA 3568 
Arg Lys Phe Tyr He Thr Glu Cys Pro Asp Leu He Glu Leu Leu Pro . 
285 290 295 300 

TGC GAA CTC GGC GGC CAA ACA GTA GTA GTC CCC TCT ATG GCA GAA CTG 3616 
Cys Glu Leu Gly Gly Gin Thr Val Val Val Pro Ser Met Ala Glu Leu 
305 310 315 

ACC ATT AGG GAT TGT CCA CGG CTG GAG GTT GGC CCG ATG ATA AGA TCA 3664 
Thr He Arg Asp Cys Pro Arg Leu Glu Val Gly Pro Met He Arg Ser 
320 325 330 

CTC CCA AAG TTC CCA ATG CTA AAG AAG TTG GAC CTC GCG GTG GCA AAT 3712 
Leu Pro Lys Phe Pro Met Leu Lys Lys Leu Asp Leu Ala Val Ala Asn 
335 340 345 

ATA ACT AAA GAG GAG GAT CTG GAT GCG ATT GGA TCC CTA GAA GAG TTG 3760 
He Thr Lys Glu Glu Asp Leu Asp Ala He Gly Ser Leu Glu Glu Leu 
350 355 360 

GTT AGT TTG GAG TTA GAG TTA GAC GAT ACA TCT TCC GGT ATA GAG AGG 3808 
Val Ser Leu Glu Leu Glu Leu Asp Asp Thr Ser Ser Gly He Glu Arg 
365 370 375 380 

ATA GTT TCC TCT TCG AAG CTG CAA AAG TTA ACT ACA CTC GTA GTG AAG 3856 
He Val Ser Ser Ser Lys Leu Gin Lys Leu Thr Thr Leu Val Val Lys 
385 390 395 

GTG CCG AGT TTG CGG GAG ATT GAA GGG CTT GAA GAG TTG AAG TCT TTA 3904 
Val Pro Ser Leu Arg Glu He Glu Gly Leu Glu Glu Leu Lys Ser Leu 
400 405 410 

CAA GAT TTG TAT CTA GAG GGT TGC ACG TCG TTG GGG AGA CTA CCA CTG 3952 
Gin Asp Leu Tyr Leu Glu Gly Cys Thr Ser Leu Gly Arg Leu Pro Leu 
415 420 425 
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GAG AAG CTG AAG GAG CTA GAC ATT GGA GGA TGC CCT GAC CTC ACT GAG 4000 
Glu Lys Leu Lys Glu Leu Asp He Gly Gly Cys Pro Asp Leu Thr Glu 
430 435 440 



TTA GTC CAA ACA GTA GTA GGA GTC CCC TCT TTG AGA GGA CTG ACC ATT 
Leu Val Gin Thr Val Val Ala Val Pro Ser Leu Arg Gly Leu Thr He 
445 450 455 460 



404B 



AGG GAT TGT CCA CGG CTG GAG GTT GGT CCA ATG ATA CAA TCT CTT CCA 
Arg Asp Cys Pro Arg Leu Glu Val Gly Pro Met He Gin Ser Leu Pro 
465 470 475 



4096 



AAG TTC CCA ATG CTA AAT GAA TTG ACG CTC TOG ATG GTA AAT ATC ACT 
Lys Phe Pro Met Leu Asn Glu Leu Thr Leu Ser Met Val Asn He Thr 
480 465 490 



4144 



AAG GAG GAT GAG CTG GAG GTG CTT GGA TCC CTA GAA GAG TTG GAT AGT 4192 
Lys Glu Asp Glu Leu Glu Val Leu Gly Ser Leu Glu Glu Leu Asp Ser 
495 500 505 

TTG GAG TTA ACG TTA GAC GAT ACA TGT TCC AGC ATA GAG AGG ATA TCT 4240 
Leu Glu Leu Thr Leu Asp Asp Thr Cys Ser Ser He Glu Arg He Ser 
510 515 520 

TTC TTG TCG AAG CTG CAA AAG TTA ACT ACA CTC ATA GTG GAG GTG CCG 4288 
Phe Leu Ser Lys Leu Gin Lys Leu Thr Thr Leu He Val Glu Val Pro 
525 530 535 540 



AGT TTG CGG GAG ATT GAA GGT CTT GCA GAG TTG AAG TCT TTA CGA ATT 4336 
Ser Leu Arg Glu He Glu Gly Leu Ala Glu Leu Lys Ser Leu Arg He 
545 550 555 



TTG TAT CTA GAA GGA TGC ACG TCG TTG GAA AGA CTG TGG CCT GAT CAA 4384 
Leu Tyr Leu Glu Gly Cys Thr Ser Leu Glu Arg Leu Trp Pro Asp Gin 
560 565 570 

CAA CAG TTG GGT AGT CTG AAG AAC CTG AAT GTG CTC GAC ATC CAA GGT 4432 
Gin Gin Leu Gly Ser Leu Lys Asn Leu Asn Val Leu Asp He Gin Gly 
575 580 585 

TGT AAA AGC TTG AGT GTT GAC CAT CTC TCT GCA CTC AAG ACC ACT CTA 4480 
Cys Lys Ser Leu Ser Val Asp His Leu Ser Ala Leu Lys Thr Thr Leu 
590 595 600 



CCG CCC AGG GCG AGG ATA ACA TGG CCC GAT CAG CCC TAC AGA TGACGGTAGG 4532 
Pro Pro Arg Ala Arg He Thr Trp Pro Asp Gin Pro Tyr Arg 
605 610 615 

AATTAATGCA AGTGACAGTG ATGACATAGT TGTGATGGCT TGCACCTGAT CAACCTTGTT . 4592 

CTTCATGGGG TTTTCTCCCC AGTGAGATCT TAATATCCAA AATTCTGGTT TGTTCAGAGG 4652 

TTATATGGTT CAGTTTTTCA C CAT AAT ATT TTCTACGGCA TAGCGCAAAC TACTTCTAGT 4712 



ATATAGATAT AGGCAATATA TATAACATCC AACTCTGTTT TACTCACTCC TCTCATCTTC 



47.72 
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TCACTCGATT ATGTTCCATT TCTAAAATCC ATTATTCATC GCCTTATATT CATAATTATG 4832 
TAATTATTT 4841 



(2) INFORMATION FOR SEQ ID NO:2: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 208 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 



Met Ser Tyr Leu Arg Glu Val Ala Thr Ala Val Ala Leu Leu Leu Pro 
15 10 15 

Phe He Leu Leu Asn Lys Phe Trp Arg Pro Asn Ser Lys Asp Ser He 
20 25 30 

Val Asn Asp Asp Asp Asp Ser Thr Ser Glu Val Asp Ala He Ser Asp 
35 40 45 

Ser Thr Asn Pro Ser Gly Ser Phe Pro Ser Val Glu Tyr Glu Val Phe 
50 55 . 60 

Leu Ser Phe Arg Gly Pro Asp Thr Arg Glu Gin Phe Thr Asp Phe Leu 
65 70 75 BO 

Tyr Gin Ser Leu Arg Arg Tyr Lys He His Thr Phe Arg Asp Asp Asp 

85 90 95 

Glu Leu Leu Lys Gly Lys Glu He Gly Pro Asn Leu Leu Arg Ala He 
100 105 110 

Asp Gin Ser Lys He Tyr Val Pro He He Ser Ser Gly Tyr Ala Asp 
115 120 125 

Ser Lys Trp Cys Leu Met Glu Leu Ala Glu He Val Arg Arg Gin Glu 
130 135 140 

Glu Asp Pro Arg Arg He He Leu Pro lie Phe Tyr Met Val Asp Pro 
145 150 155 160 



Ser Asp Val Arg His Gin Thr Gly Cys Tyr Lys Lys Ala Phe Arg Lys 
165 170 175 



His Ala Asn Lys Phe Asp Gly Gin Thr He Gin Asn Trp Lys Asp Ala 
180 185 190 
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Leu Lys Lys Val Gly Asp Leu Lys Gly Trp His lie Gly Lys Asn Asp 
195 200 205 

Lys 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 369 amino acids 

(B) TYPE : amino acid 
(D) TOPOLOGY: linear 

(ii) MOLE COLE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:3: 

Gin Gly Ala He Ala Asp Lys Val Ser Ala Asp He Trp Ser His He 
1 5 10 15 

Ser Lys Glu Asn Leu He Leu Glu Thr Asp Glu Leu Val Gly lie Asp 
20 25 30 

Asp His He Thr Ala Val Leu Glu Lys Leu Ser Leu Asp Ser Glu Asn 
35 40 45 

Val Thr Met Val Gly Leu Tyr Gly Met Gly Gly He Gly Lys Thr Thr 
50 55 60 

Thr Ala Lys Ala Val Tyr Asn Lys He Ser Ser Cys Phe Asp Cys Cys 
65 70 75 80 

Cys Phe He Asp Asn He Arg Glu Thr Gin Glu Lys Asp Gly Val Val 

65 90 95 

Val Leu Gin Lys Lys Leu Val Ser Glu He Leu Arg He Asp Ser Gly 
100 105 110 

Ser Val Gly Phe Asn Asn Asp Ser Gly Gly Arg Lys Thr He Lys Glu 
115 120 125 

Arg Val Ser Arg Phe Lys He Leu Val Val Leu Asp Asp Val Asp Glu 
130 135 140 

Lys Phe Lys Phe Glu Asp Met Leu Gly Ser Pro Lys Asp Phe He Ser 
145 150 155 160 

Gin Ser Arg Phe He He Thr Ser Arg Ser Met Arg Val Leu Gly Thr 
165 170 175 



Leu Asn Glu Asn Gin Cys Lys Leu Tyr Glu Val Gly Ser Met Ser Lys 
180 185 190 
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Pro Arg Ser Leu Glu Leu Phe Ser Lys His Aia Phe Lys Lye Asn Thr 
195 200 205 

Pro Pro Ser Tyr Tyr Glu Thr Leu Ala Asn Asp Val Val Asp Thr Thr 
210 215 220 

Ala Gly Leu Pro Leu Thr Leu Lys Val He Gly Ser Leu Leu Phe Lys 
225 230 235 240 

Gin Glu He Ala Val Trp Glu Asp Thr Leu Glu Gin Leu Arg Arg Thr 
245 250 255 

Leu Asn Leu Asp Glu Val Tyr Asp Arg Leu Lys He Ser Tyr Asp Ala 
260 265 270 

Leu Asn Pro Glu Ala Lys Glu He Phe Leu Asp lie Ala Cys Phe Phe 
275 2B0 285 

He Gly Gin Asn Lys Glu Glu Pro Tyr Tyr Met Trp Thr Asp Cys Asn 
290 295 300 

Phe Tyr Pro Ala Ser Asn He He Phe Leu He Gin Arg Cys Met He 
305 310 315 320 



Gin Val Gly Asp Asp Asp Glu Phe Lys Met His Asp Gin Leu Arg Asp 
325 330 335 



Met Gly Arg Glu 
340 

Ser Arg He Trp 
355 



He Val Arg Arg 



Ser Ala Glu Glu 
360 



Glu Asp Val Leu 
345 

Gly He Asp Leu 



Pro Trp Lys Arg 
350 

Leu Leu Asn Lys 
365 



Lys 



(2) INFORMATION FOR SEQ ID NO:4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 98 amino acids 

(B) TYPE: amino acid 
CD) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Gly Ser Ser Lys Val Lys Ala He Ser He Pro Trp Gly Val Lya Tyr- 
15 10 15 



Glu Phe Lys Ser Glu Cys Phe Leu Asn Leu Ser Glu Leu Arg Tyr Leu 
20 25 30 
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His Ala Arg Glu Ala Met Leu Thr Gly Asp Phe Asn Asn Leu Leu Pro 
35 40 45 

Asn Leu Lys Trp Leu Glu Leu Pro Phe Tyr Lys His Gly Glu Asp Asp 
50 55 €0 

Pro Pro Leu Thr Asn Tyr Thr Met Lys Asn Leu lie lie Val lie Leu 
65 70 75 80 

Glu His Ser His He Thr Ala Asp Asp Trp Gly Gly Trp Arg His Met 

85 90 95 

Met Lys 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 619 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:5: 

Met Ala Glu Arg Leu Lys Val Val Arg Leu Ala Ser Asn Tyr Ser Leu 
15 10 15 

Tyr Gly Arg Arg Val Arg Leu Ser Asp Cys Trp Arg Phe Pro Lys Ser 
20 25 30 

He Glu Val Leu Ser Met Thr Ala He Glu Met Asp Glu Val Asp He 
35 40 45 

Gly Glu Leu Lys Lys Leu Lys Thr Leu Val Leu Lys Phe Cys Pro He 
50 55 60 

Gin Lys He Ser Gly Gly Thr Phe Gly Met Leu Lys Gly Leu Arg Glu 
65 70 75 80 

Leu Cys Leu Glu Phe Asn Trp Gly Thr Asn Leu Arg Glu Val Val Ala 

85 90 95 

Asp He Gly Gin Leu Ser Ser Leu Lys Val Leu Lys Thr Thr Gly Ala 
100 105 110 

Lys Glu Val Glu He Asn Glu Phe Pro Leu Gly Leu Lys Glu Leu Ser 
115 120 125 

Thr Ser Ser Arg He Pro Asn Leu Ser Gin Leu Leu Asp Leu Glu Val 
130 135 140 

Leu Lys Val Tyr Asp Cys Lys Asp Gly Phe *sp Met Pro Pro Ala Ser 
145 150 155 160 
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Pro Ser Glu Asp Glu Ser Ser Val Trp Trp Lys Val Ser Lys Leu Lys 
165 170 175 

Ser Leu Gin Leu Glu Lys Thr Arg He Asn Val Asn Val Val Asp Asp 
180 IBS 190 

Ala Ser Ser Gly Gly His Leu Pro Arg Tyr Leu Leu Pro Thr Ser Leu 
195 200 205 

Thr Tyr Leu Lys He Tyr Gin Cys Thr Glu Pro Thr Trp Leu Pro Gly 
210 215 220 

He Glu Asn Leu Glu Asn Leu Thr Ser Leu Glu Val Asn Asp He Phe 
225 230 235 240 

Gin Thr Leu Gly Gly Asp Leu Asp Gly Leu Gin Gly Leu Arg Ser Leu 
245 250 255 

Glu He Leu Arg He Arg Lys Val Asn Gly Leu Ala Arg He Lys Gly 
260 265 270 

Leu Lys Asp Leu Leu Cys Ser Ser Thr Cys Lys Leu Arg Lys Phe Tyr 
275 280 285 

He Thr Glu Cys Pro Asp Leu He Glu Leu Leu Pro Cys Glu Leu Gly 
290 295 300 

Gly Gin Thr Val Val Val Pro Ser Met Ala Glu Leu Thr He Arg Asp 
305 310 315 320 

Cys Pro Arg Leu Glu Val Gly Pro Met lie Arg Ser Leu Pro Lys Phe 
325 330 335 

Pro Met Leu Lys Lys Leu Asp Leu Ala Val Ala Asn He Thr Lys Glu 
340 345 350 

Glu Asp Leu Asp Ala He Gly Ser Leu Glu Glu Leu Val Ser Leu Glu 
355 360 365 

Leu Glu Leu Asp Asp Thr Ser Ser Gly He Glu Arg He Val Ser Ser 
370 375 380 

Ser Lys Leu Gin Lys Leu Thr Thr Leu Val Val Lys Val Pro Ser Leu 
385 390 395 400 

Arg Glu He Glu Gly Leu Glu Glu Leu Lys Ser Leu Gin Asp Leu Tyr 
405 410 15 

Leu Glu Gly Cys Thr Ser Leu Gly Arg Leu Pro Leu Glu Lys Leu Lys 
420 425 430 



Glu Leu Asp He Gly Gly Cys Pro Asp Leu Thr Glu Leu Val Gin Thr 
435 440 445 
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Val Val Ala Val Pro Ser Leu Arg Gly Leu Thr He Arg Asp Cys Pro 
450 455 460 

Arg Leu Glu Val Gly Pro Met He Gin Ser Leu Pro Lys Phe Pro Met 
465 470 475 480 

Leu Asn Glu Leu Thr Leu Ser Met Val Asn He Thr Lys Glu Asp Glu 
485 490 495 

Leu Glu Val Leu Gly Ser Leu Glu Glu Leu Asp Ser Leu Glu Leu Thr 
500 505 510 

Leu Asp Asp Thr Cys Ser Ser lie Glu Arg He Ser Phe Leu Ser Lys 
515 520 525 

Leu Gin Lys Leu Thr Thr Leu He Val Glu Val Pro Ser Leu Arg Glu 
530 535 540 

He Glu Gly Leu Ala Glu Leu Lys Ser Leu Arg He Leu Tyr Leu Glu 
545 550 555 560 

Gly Cys Thr Ser Leu Glu. Arg Leu Trp Pro Asp Gin Gin Gin Leu Gly 
565 570 575 



Ser Leu Lys Asn Leu Asn Val Leu Asp He Gin Gly Cys Lys Ser Leu 
580 585 590 

Ser Val Asp His Leu Ser Ala Leu Lys Thr Thr Leu Pro Pro Arg Ala 
595 600 605 



Arg He Thr Trp Pro Asp Gin Pro Tyr Arg 
610 615 
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(2) INFORMATION FOR SEQ ID NO:G: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND ED NESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 
{B> LOCATION: 1..90 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

GTG TGT TGT TTT TCA GCT GTT CAT ATG AAG GTG TGT TAT CTT CTT ATT 48 
Val Cys Cys Phe Ser Ala Val His Met Lys Val Cys Tyr Leu Leu He 
1 5 10 15 

TGT TCT TCA TAT TTC TGT TTT AAT CTG CTG TCA GAT GGC TG 90 
Cys Ser Ser Tyr Phe Cys Phe Asn Leu Leu Ser Asp Gly 

20 25 30 



(2) INFORMATION FOR SEQ ID NO:7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Val Cys Cys Phe Ser Ala Val His Met Lys Val Cys Tyr Leu Leu He 
X 5 10 15 

Cys Ser Ser Tyr Phe Cys Phe Asn Leu Leu Ser Asp Gly 
20 25 
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(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acida 

(B) TYPE: amino acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
(xi(i>) /: Alternative amino acid 



Gly Xaa Xaa Xaa Xaa Gly Lya Thr / Ser 
1 S 



(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

Gly Met Gly Gly He Gly Lys Thr Thr 
1 5 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 



Gly Leu Tyr Gly Met Gly Gly He Gly Lys Thr Thr 
15 10 
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(2) INFORMATION FOR SEQ ID NO: II: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 840 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 



AAAAAAAAAT 
ATCTCATCCT 
CACTAACCAC 
CCTTGATCAT 
AACCGGTCCG 
CTGACTTGTG 
TTGAATTTGT 
TCAGAAGTAA 
GCAAATTGAA 
ATCCTTCAOA 
CTACTGCTGT 
CCAAAGACTC 
ACTCCACAAA 
GGGGTCCAGA 



TATAAGTTAA 
TCACTCCATT 
TTCAAAATAG 
GTAGCCAATT 
ATTTGAAATT 
GGCCTGCATT 
GGAGCAATAA 
GGGATAAGGC 
GCAGGCAAGA 
TCATTTCTGC 
TGCCTTGCTT 
AATCGTCAAC 
TCCCTCTGGT 
TACTCGTGAA 



CAATTATACG 
TTAGCAATTG 
AAAAATAATG 
AGGTGATTCT 
TGACCCAGGT 
ATTTGTAGAT 
TTGAGTTTTT 
AAGAAGCAGA 
ACAGTTACTG 
TCAATTGAAT 
CTCCCTTTCA 
GATGATGACG 
TCATTTCCCT 
CAGTTCACCG 



TATGTTCCTA 
TTAAAATTTT 
CGTAGCACAC 
TGTGGACAAG 
TTCGAAATTC 
AAGAAAGAGT 
GAATGGAGGA 
GGAGCAGAGA 
GAAATTCATT 
CACTAGTCAT 
TTCTTCTCAA 
ATTCAACATC 
CCGTGGAGTA 
ATTTCCTATA 



ACAATTGCTA 
GGAATCGATA 
AAACAACAGG 
TAGAACTAGT 
TAACCAACCC 
TTACAAATGG 
TACTGAAAGC 
GAAGAACCAG 
TATCTCTGCT 
GAGTTATTTG 
CAAGTTTTGG 
TGAAGTTGAT 
TGAAGTGTTT 
TCAGTCTCTC 



AATAAATATT 60 

AATCTCGATC 120 

ATTGGTGGCC 180 

TGTACTCGCT 240 

AAATGGTTGA 300 

TTGGCTTACA 360 

CAGTGGGAGC 420 

CAAAGCACAT 480 

TTCAATTTCT 540 

AGAGAAGTTG 600 

AGACCAAATT 660 

GCCATATCCG 720 

TTGAGTTTCA 780 

CGTCGCTATA 840 
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(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 150 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

- (ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 12 



Pro Asp Leu Thr Glu Leu Val Gin Thr Val Val Ala Val Pro Ser Leu 
15 10 15 

Arg Gly Leu Thr lie Arg Asp Cys Pro Arg Leu Glu Val Gly Pro Met 
20 25 30 

lie Gin Ser Leu Pro Lys Phe Pro Met Leu Asn Glu Leu Thr Leu Ser 
35 40 45 

Met Val Asn lie Thr Lys Glu Asp Glu Leu Glu Val Leu Gly Ser Leu 
50 55 60 

Glu Glu Leu Asp Ser Leu Glu Leu Thr Leu Asp Asp Thr Cys Ser Ser 
65 70 75 80 

He Glu Arg He Ser Phe Leu Ser Lys Leu Gin Lys Leu Thr Thr Leu 
85 90 95 

He Val Glu Val Pro Ser Leu Arg Glu He Glu Gly Leu Ala Glu Leu 
100 105 110 

Lys Ser Leu Arg He Leu Tyr Leu Glu Gly Cys Thr Ser Leu Glu Arg 
115 120 125 

Leu Trp Pro Asp Gin Gin Gin Leu Gly Ser Leu Lys Asn Leu Asn Val 
130 135 140 



Leu Asp He Gin Gly Cys 
145 150 
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(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:13: 



He Leu Val Val Leu Asp Asp Val Asp 

1 5 
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CLAIMS 

1. An isolated nucleic acid molecule comprising a sequence of nucleotides which 
confers or otherwise facilitates disease resistance in plants. 

2. An isolated nucleic acid molecule according to claim 1 wherein the disease 
resistance is rust resistance. . 



3. An isolated nucleic aid molecule according to claim 2 comprising a sequence of 
nucleotides as set forth in SEQ ID NO:l or having at least 45% similarity to all or 
part thereof. 

4. An isolated nucleic acid molecule according to claim 2 comprising a sequence of 
nucleotides encoding or complementary to a sequence of nucleotides encoding the 
amino acid sequence set forth in SEQ ID NOs:2 to 5 or having at least 45% 
similarity to all or part thereof. 

5. An isolated nucleic acid molecule comprising a sequence of nucleotides encoding 
or complementary to a sequence encoding a peptide, polypeptide or protein which is 
capable of interacting with an avirulence gene product on a flax rust. 

6. An isolated nucleic acid molecule according to claim 5 wherein the nucleic acid 
molecule corresponds to an allele of rust resistance gene L or M. 

7. An isolated nucleic acid molecule according to claim 5 or 6 wherein the nucleic 
acid molecule corresponds to an L6 allele of rust resistance gene L and the 
corresponding avirulence gene is AL6. 

8. An isolated nucleic acid molecule according to claim 5 or 6 wherein the nucleic 
acid molecule corresponds to the L2 allele of the rust resistance gene L and the 
corresponding avirulence gene is AL2. 

9. An isolated nucleic acid molecule according to claim 5 or 6 wherein the nucleic 
acid molecule corresponds to the LI 0 allele of the rust resistance gene L and the 
corresponding avirulence gene is ALIO. 
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10. An isolated nucleic acid molecule which: 

(i) confers rust resistance in a plant; and 

(ii) is capable of hybridizing under low stringency conditions to the 
nucleic acid molecule defined in SEQ ID NO:l. 

11. An isolated nucleic acid molecule according to claim 10 wherein the nucleic 
acid molecule is capable of hybridizing to the nucleic acid defined in SEQ ID NOs:2 
to 5 under medium stringent conditions. 

12. An isolated nucleic acid molecule according to claim 10 wherein the nucleic 
acid molecule is capable to hybridizing to the nucleic acid defined in SEQ ID NO:l 1 
under high stringent conditions. 

13. An isolated nucleic acid molecule according to claim 10 or 11 or 12 having a 
nucleotide sequence substantially corresponding to the nucleotide sequence set forth 
in SEQ ID NO:l or having at least 80% similarity to all or part thereof. 

14. An isolated nucleic acid molecule according to claim 1 or 5 or 10 isolatable 
from a flax plant 

15. An isolated nucleic acid molecule according to claim 1 or 5 or 10 isolatable 
from a species of Linton or a soybean, sunflower, cereal, or wild varieties of wheat 
or barley. 

16. An isolated nucleic acid molecule according to claim IS isolatable from 
Linum marginale. 

17. A method for identifying a rust resistance genetic sequence in a plant, said 
method comprising contacting genomic DNA or cDNA or mRNA from said plant 
with a hybridizing effective amount of a genetic sequence conferring or facilitating 
rust resistance or part thereof and then detecting said hybridization. 

18. A method according to claim 17 wherein the plant is flax. 
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19. A method according to claim 17 wherein the plant is a species of Linum, 
soybean, sunflower, cereal, or a wild variety of wheat or barley. 

20. A method according to claim 17 wherein the plant is Linum marginale. 

21. A method according to claim 17 wherein the rust resistance genetic sequence 
is as set forth in SEQ ID NO:l or is a portion or part thereof encoding rust resistance 
or has at least 60% similarity thereto. 

22. A method according to claim 17 wherein the rust resistance genetic sequence 
encodes an amino acid sequence substantially as set forth in SEQ ID NOs:2 to 5 or 
having at least 60% similarity thereto. 

23. An isolated nucleic acid molecule comprising a sequence of nucleotides 
which: 

(i) confers or otherwise facilitates disease resistance in plants; and 

(ii) encodes a product having at least one leucine rich region and/or 
comprises a p-loop at the 5* end of the nucleic acid molecule which 
encodes the amino acid sequence: 

GXXXXGKT/S, 
where X is any amino acid residue. 

24. An isolated nucleic acid molecule according to claim 23 wherein the leucine 
rich region is a leucine rich repeat encoded by the 3* end of the molecule. 

25. A nucleic acid molecule according to claim 23 or 24 wherein the leucine 
region comprises the amino acid sequence: 

PDLIELLPCELGGQTW . VPSMAELTIRDCPRLEVGPMIRSLPKFPMLKK 
LDIAVANITKEEDLDAIGSLEELVSLELELDDTSSGIERIVSSSKLQKLT 

TLVVKVPSLREIEGLEELKSLQDLYLEGCTSLGRL PLEKLKE 

LDIGGC 

or having at least 60% similarity to all or part thereof. 

26. A nucleic acid molecule according to claim 23 wherein the p-loop sequence 
encodes the amino acid residues: 

GMGGIGKTT 
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27. A nucleic acid molecule according to claim 23 wherein the p-loop sequence 
encodes the amino acid residues: 

GLYGMGGIGKTT 

28. A nucleic acid molecule according to claim 23 wherein the disease resistance 
is rust resistance. 

29. A nucleic acid molecule according to claim 28 wherein the plant is a crop 
plant 

30. A nucleic acid molecule according to claim 29 wherein the plant is wheat or 
barley. 

31. A nucleic acid molecule according to claim 23 comprising a nucleotide 
sequence as set forth in SEQ ID NO:l or encoding an amino acid sequence as set 
forth in SEQ ID NOs:2 to 5 or having at least 60% similarity to all or part thereof 

32. A modular resistance gene comprising heterologous genetic sequences from at 
least two nucleic acid molecules encoding disease resistance in plants. 

33. A modular resistance gene according to claim 32 wherein the disease 
resistance is rust resistance. 

34. A modular resistance gene according to claim 33 comprising all or part of 
genetic sequences encoding leucine rich regions from alleles of at least two rust 
resistance genes selected from L and M. 

35. A modular resistance gene according to claim 34 comprising all or part of 
genetic sequences encoding leucine rich regions from alleles of at least two rust 
resistance genes selected from L6, L2 and L10. 

36. A transgenic plant carrying a non-indigenous genetic sequence conferring or 
otherwise facilitating rust resistance in said plant 

37. A transgenic plant according to claim 36 wherein the non-indigenous sequence 
comprises all or part of the nucleotide sequence set forth in SEQ ID NO:l or encodes 
an amino acid sequence as set forth in SEQ ID NOs:2 to 5 or having at least 60% 
similarity to all or part thereof. 
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38. A transgenic plant according to claim 36 or 37 wherein said plant is wheat 
barley. 
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